In this paper, we revisit the problem of entity resolution and propose a novel, logical framework, LACE, which mixes declarative and procedural elements to achieve a number of desirable properties. Our approach is fundamentally declarative in nature: it utilizes hard and soft rules to specify conditions under which pairs of entity references must or may be merged, together with denial constraints that enforce consistency of the resulting instance. Importantly, however, rule bodies are evaluated on the instance resulting from applying the already 'derived' merges. It is the dynamic nature of our semantics that enables us to capture collective entity resolution scenarios, where merges can trigger further merges, while at the same time ensuring that every merge can be justified. As the denial constraints restrict which merges can be performed together, we obtain a space of (maximal) solutions, from which we can naturally define notions of certain and possible merges and query answers. We explore the computational properties of our framework and determine the precise computational complexity of the relevant decision problems. Furthermore, as a first step towards implementing our approach, we demonstrate how we can encode the various reasoning tasks using answer set programming.

LACE: A Logical Approach to Collective Entity Resolution / Bienvenu, M.; Cima, Gianluca; Gutierrez-Basulto, V.. - (2022), pp. 379-391. (Intervento presentato al convegno ACM SIGMOD-SIGACT-SIGART Conference on Principles of Database Systems tenutosi a Philadelphia; Usa) [10.1145/3517804.3526233].

LACE: A Logical Approach to Collective Entity Resolution

Cima Gianluca
;
2022

Abstract

In this paper, we revisit the problem of entity resolution and propose a novel, logical framework, LACE, which mixes declarative and procedural elements to achieve a number of desirable properties. Our approach is fundamentally declarative in nature: it utilizes hard and soft rules to specify conditions under which pairs of entity references must or may be merged, together with denial constraints that enforce consistency of the resulting instance. Importantly, however, rule bodies are evaluated on the instance resulting from applying the already 'derived' merges. It is the dynamic nature of our semantics that enables us to capture collective entity resolution scenarios, where merges can trigger further merges, while at the same time ensuring that every merge can be justified. As the denial constraints restrict which merges can be performed together, we obtain a space of (maximal) solutions, from which we can naturally define notions of certain and possible merges and query answers. We explore the computational properties of our framework and determine the precise computational complexity of the relevant decision problems. Furthermore, as a first step towards implementing our approach, we demonstrate how we can encode the various reasoning tasks using answer set programming.
2022
ACM SIGMOD-SIGACT-SIGART Conference on Principles of Database Systems
answer set programming; collective entity resolution; complexity analysis; declarative framework; logical constraints
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
LACE: A Logical Approach to Collective Entity Resolution / Bienvenu, M.; Cima, Gianluca; Gutierrez-Basulto, V.. - (2022), pp. 379-391. (Intervento presentato al convegno ACM SIGMOD-SIGACT-SIGART Conference on Principles of Database Systems tenutosi a Philadelphia; Usa) [10.1145/3517804.3526233].
File allegati a questo prodotto
File Dimensione Formato  
Bienvenu_LACE_2022.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.15 MB
Formato Adobe PDF
1.15 MB Adobe PDF   Contatta l'autore
Bienvenu_postprint_LACE_2022.pdf

accesso aperto

Note: https://doi.org/10.1145/3517804.3526233
Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 631.93 kB
Formato Adobe PDF
631.93 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1678182
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 0
social impact